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Abstract 

' Call a function f : ¥2 ^ {Oi 1} odd- cycle- free if there are no xi,...,Xk S F2 with k an 

! odd integer such that f{xi) — ■ ■ ■ — f{xk) — 1 and xi + • • • + Xfc — 0. We show that one 

^~»' can distinguish odd-cycle-free functions from those e-far from being odd-cycle-free by making 

poly(l/e) queries to an evaluation oracle. To obtain this result, we use connections between 
basic Fourier analysis and spectral graph theory to show that one can reduce testing odd-cycle- 
freeness of Boolean functions to testing bipartiteness of dense graphs. Our work forms part of 
a recent sequence of works that shows connections between testability of properties of Boolean 
functions and of graph properties. 

We also prove that there is a canonical tester for odd-cycle-freeness making poly (1/e) queries, 
! meaning that the testing algorithm operates by picking a random linear subspace of dimension 

c/3 ' 0(logl/e) and then checking if the restriction of the function to the subspace is odd-cycle-free 

, or not. The test is analyzed by studying the effect of random subspace restriction on the Fourier 

coefficients of a function. Our work implies that testing odd-cycle-freeness using a canonical 
tester instead of an arbitrary tester incurs no more than a polynomial blowup in the query 
^ ' complexity. The question of whether a canonical tester with polynomial blowup exists for all 

I linear-invariant properties remains an open problem. 

' Keywords: property testing, Boolean functions, Fourier analysis, Cayley graphs, eigenvalues 

in 

5 ! 1 Introduction 

A property testing algorithm is required to distinguish objects that satisfy a given property from 
objects that are "far" from satisfying the property. One can trace the origins of property testing 
r> ' as an area of study to two distinct origins: [BLR93] (and subsequently [RS96]) which formally 

. investigated the testability of linearity and other properties of Boolean functions and [GGR98] which 

studied the testability of various graph properties. Although [GGR98] was inspired by the preceding 
work on Boolean functions, the two directions evolved more or less independently in terms of the 
themes considered and the techniques employed. Recently though, this has dramatically changed, 
and quite a few surprising connections have emerged. In this work, we draw more connections 
between these two apparently different areas and show how ideas and tools used in the study of 
graph properties can be used to test certain properties of Boolean functions. 
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We start with a few definitional remarks. A property of Boolean functions is a subset P C {/ : 
{0,1}" —7- {0,1}}. The distance between f,g : {0,1}" — t- {0,1} is given by the Hamming metric 
^ifid) = 7^ g{x)], and the distance from / to P is 6p{f) = min^gp S{f,g). A function / is 

e-far from V if S-p{f) > e. These definitions carry over to graph properties^, where the distance to 
a graph property V is said to be e if en? edges need to be added to or removed from the given graph 
on n vertices in order to obtain a graph in "P. A tester for "P is a randomized algorithm which, 
given oracle access to the input and a parameter e G (0, 1), accepts with probability at least 2/3 
when the input is in V and rejects with probability at least 2/3 when it is e-far from V. In the case 
of Boolean functions, the tester can query the value of the function at any clement of {0, 1}", and 
in the case of graphs, it can query the adjacency matrix at any location. The complexity of a tester 
is measured by the number of queries it makes to the oracle, and if this quantity is independent of 
n, the property is called (strongly) testable. A one-sided error tester should accept every object in 
V with probability 1 and reject every object that is e-far from V with probability 2/3. 

Our main focus in this paper is the study of the following property of Boolean functions. 

Definition 1 (Odd-Cycle-Freeness) A function / : F2 — ^ {0, 1} is said to be odd-cycle-free 
(OCF) if for all odd k > 1, there are no Xi,X2, ■ ■ ■ ,Xk G F2 satisfying xi + ■ ■ ■ + Xk = and 
f{xi) = 1 for all i G [A;]. 

The name "odd-cyclc-frceness" arises from the observation that / is OCF if and only if the 
Cayley graph^ with the support of / as its generators is free of cycles of odd length, i.e. is 
bipartite. The property of bipartiteness in general graphs has been extensively studied, and nearly 
tight upper and lower bounds are known for its query complexity [GGR98, AK02, BT04, KKR04]. 
In this work, wc show that odd-cyclc-frccness for Boolean functions is testable with comparable 
query complexity and moreover, using tests that are very similar to the ones for graph bipartiteness. 

Odd-cycle-freeness can also be described in a more algebraic way. As observed in Section 2, 
given a function / : F" — )■ {0,1} with density /9=^Ej;[/(a;)], the distance of / to OCF is exactly 
\{p + vahict f{a)). So, a Boolean function's distance to OCF is directly connected to the (signed) 
value of its smallest Fourier coefficient. This link proves crucial in our analysis of tests for OCF. 

Our work is part of a larger program to understand the structure of testable properties of 
Boolean functions. We explain this perspective next. 

Common themes in testing A leading question in the search for common unifying themes in 
property testing has been that of discovering necessary and sufficient conditions for strong testa- 
bility. Kaufman and Sudan [KS08] suggest that linear invariance is a natural property of boolean 
functions and play an important role in testing. Formally, a property^ 'P ^ {/ : F2 — > {0, 1}} 
is said to be linear-invariant if for any / G P, it is also the case that f o L G V, for any F2- 
linear transformation L : F2 ^ Fg. Some notable examples of properties that were shown to be 
testable and which are invariant under linear transformations of the domain include linear func- 
tions [BLR93], low degree polynomials [AKK+05], and functions with low Fourier dimensionality 
or sparsity [GOS+09]. 

^In this paper, when we refer to graph properties, we will always mean the dense graph model where the graph is 
represented by its adjacency matrix. 

^See Section 3 for the precise definition. 

^Henceforth, we will identify {0, 1}" with the vector space F2 . 
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A general class of linear invariant families can be described in terms of forbidden patterns. The 
first instance of this perspective appeared in the work of Green [Gre05] in testing if a Boolean 
property is triangle-free. Formally, / is triangle-free if it is free from the pattern {f{x), f{y),f{x + 
y)) = (1^), for any x, y G More generally, a function is said to be free from solutions to the 

linear equation xi + • • • + = if there are no x = (xi , . . . , x^) G (F2 )^ satisfying xi + • • • + x^. = 
and /(xj) = 1 for all i G [k]. Pushing this generalization further, for a matrix M G F™^'^ and 
a G {0, 1}'^, we say that / is (M, c7)-free if there is no x = (xi, . . . , x^) G {^2)^ such that Mx = 
and /(xj) = (Tj for all i G [n]. This corresponds to freeness from solutions to a system of linear 
equations. When cr = 1*^, notice that if / G "P, then any function obtained from / by removing 
elements in the support of / also must belong to "P, and in this case V is said to be monotone. 

Green [Gre05] proved that (M, I'^j-freeness is testable with one sided error when M is a rank 
1 matrix. Krai', Serra and Vena [KSVll] and Shapira [Sha09] showed that this is true regardless 
of the matrix M. The setting when a ^ 1^ was introduced in [BCSXll], where it is shown that 
(M, cj)-frccness is testable for any a as long as M is of rank one. The case of M being rank 1 was 
fully resolved by Bhattacharyya, Grigorescu and Shapira [BGSIO] who showed that any (possibly 
infinite) intersection of such properties is also testable. Now, note that odd-cycle-freeness is an 
example of such a property; it is the intersection of (C^, l*^)-freeness for all odd A; > 1, where is 
the row vector [11 • • • 1] of size k. We next state this result formally. 

Theorem 2 ([BGSIO]) There exists a function f : (0,1) Z"*" such that the following is true. 
For any e > 0, there is a one-sided tester with query complexity /(e) that distinguishes OCF 
functions from functions e- far from OCF. 

In fact, odd-cycle-freeness is not just an "example" of a property shown to be testable by 
[BGSIO]: any monotone property that can be expressed as freeness from solutions to an infinite set 
of linear equations is equivalent to the odd-cycle-freeness property (see Section 5 for the short argu- 
ment). Thus, we view the study of odd-cycle-freeness as an important step towards understanding 
the testability of monotone linear-invariant properties. 

Surprisingly, a similar picture has been staged in the world of testable properties in the dense 
graph model. Just as for Boolean functions, triangle freeness, which was shown (implicitly) by Ruzsa 
and Szemeredi [RS78] to be testable, brought up a wealthy perspective in the area. It was followed 
by exciting results in testing i7- freeness [ADL+94] and induced //-freeness [AFKSOO] which are 
somewhat analogous to the results on testing monotone and non-monotone properties of Boolean 
functions. This direction culminated with a nearly complete characterization of all properties that 
are testable with one-sided error and constant number of queries [AS08]. 

The proofs in [BGSIO] draw heavily on the ideas used in characterizing general graph properties, 
such as the Szemeredi regularity lemma and Ramsey theory, and just like in that case, the query 
complexity bounds obtained are a tower of exponentials. Thus, the bound obtained in [BGSIO] 
for /(e) in Theorem 2 above is an embarrassingly weak tower of exponentials. This brings up the 
question of characterizing the class of linear-invariant properties which can be tested with poly(l/e) 
queries. In this work, we show that odd-cycle-freeness belongs to this class of properties by the 
way of two different proofs, each with its own message. 

1.1 The Edge-Sampling Test 

Our first approach for testing OCF relies on reducing testing OCF in Boolean functions to testing 
bipartiteness of the Cayley graph associated with the function. More precisely, we will show that 



3 



the following algorithm is a tester for OCF. 



Edge-sampling test (Input: oracle access to / : F2 ^ {0, 1}) 

1. Uniformly pick ai, . . . , G ¥2, where k = 0(l/e). Let G = {ctj — aj : i < j}. 

2. Accept if and only if the restriction of / to G is odd-cycle- free. 



Theorem 3 The edge-sampling test is a one-sided tester for odd- cycle- freeness with query com- 
plexity 0(l/e^). 

For a function / : F2 — >• {0,1}, define the Cayley graph Q{f) = iViE) to be the graph with 
vertex set V = ¥2 and edge set E = {{x,y) \ f{x — y) = 1}. It's not hard to show that / is far from 
odd-cycle-free if and only if Q{f) is far from any bipartite Cayley graph. The crux in the analysis 
of the above testing algorithm involves showing that if / is e-far from being odd-cycle-free, then 
its Cayley graph Q{f ) is e/2-far from any bipartite graph. The proof relies on the well-known fact 
that the eigenvalues of the adjacency matrix of G{f) are exactly the Fourier coefHcients of /, and 
uses spectral techniques to analyze the distance to bipartiteness. Finally, the test emulates the test 
for bipartiteness of [AK02]. 

In fact, we prove something stronger: the distance of Q{f) from bipartiteness is exactly half 
the distance of / from OCF. Using the fact [GGR98, AdlVKKOS] that one can estimate a graph's 
distance from bipartiteness using poly(l/e) queries to within additive error e, it follows that one 
can estimate the distance of / from OCF. 

Theorem 4 There exists an algorithm that, given oracle access to a function / : F2 — >■ {0, 1} and 

a parameter e G (0, 1), makes poly(l/r) queries and returns the distance of f to OCF to within an 
error o/±e. The same holds for approximating mina/(a) to within an error ofzte. 

The second statement is because of the connection between the distance to OCF and Fourier 
coefficients mentioned earlier. Using the above, we also obtain a poly(l/e)-query algorithm to 
approximate distance to linearity that seems different from previously known ones [BLR93, PRR06]. 

1.2 The Subspace Restriction Test 

Call a tester for a graph property V canonical if it works by picking a set of vertices uniformly at 

random, querying all the edges spanned by these vertices and checking whether the induced graph 
satisfies V or not. [Alo02, GT03] showed that if P is a hereditary graph property (i.e., if a graph 
G satisfies V, then so does every induced subgraph of G), then P can be in fact tested using a 
canonical tester with only a quadratic blowup in the query complexity. Moreover, for many natural 
hereditary graph properties, and in particular, for the property of graph bipartiteness, there is 
asymptotically no loss in using a canonical tester. The existence of a canonical tester also makes 
convenient proving lower bounds for hereditary graph properties. It is thus natural to ask if a 
similar theorem can be proved in the context of testing properties of Boolean functions. 

Suppose P is a suh space-hereditary linear-invariant property of Boolean functions, meaning 
that if a function / : F2 — ?■ {0, 1} satisfies V, then so does / restricted to any linear subspace of 
the domain. Subspace-hereditary properties are especially interesting because they include most 
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natural linear-invariant properties and are conjectured in [BGSIO] to be exactly the class of one- 
sided testable linear-invariant properties (modulo some technicalities). Now, just as a canonical 
tester for a hereditary graph property works by sampling a set of vertices S and querying all edges 
induced in S, one defines a canonical tester for a subspace-hereditary property V to be the algorithm 
that, on input e G (0, 1) and oracle access to / : F2 — > {0, 1}, chooses uniformly at random a linear 
subspace H oi dimension d(e, n) (for some fixed function d : [0, 1] x — t- Z^) and accepts if 

and only if / restricted to H satisfies V. The query complexity of the canonical tester is obviously 
2d{e,n)^ the size of the subspace inspected by the tester. It is shown in [BGSIO], using ideas similar 
to those in [Alo02, GT03], that any tester for a subspace-hereditary linear- invariant property can 
be converted to be of canonical form, but at the expense of an exponential blowup in the query 
complexity. The question that arises then is whether, instead of an exponential blowup, only a 
polynomial blowup in the query complexity is always possible. 

Question 5 Given a subspace-hereditary property V that can be tested with q queries, is there 
always a canonical tester of complexity poly(g) ? 

This seems to be a hard question in general. However, in this work, we show that for the property 
of odd-cycle-freeness, the answer to Question 5 is affirmative. (Note that the edge-sampling test is 
not canonical.) 



Subspace restriction test (Input: oracle access to / : F2 — ^ {0, 1}) 

1. Uniformly pick ai,. . . ,ak € F2 , where k = 0(log i). Let H be the linear subspace spanned 
by ai, ...,ak. 

2. Accept if and only if the restriction of / to is odd-cycle-free. 



Theorem 6 The subspace restriction test is a one-sided tester for odd-cycle-freeness with query 
complexity 0{l/e'^^). 

The analysis of the subspace-restriction test relies on a Fourier analytic argument. One can 
easily see that the test accepts every OCF function. The main insight is that certain properties 
of the Fourier spectrum of a function that is e-far from being OCF are preserved under random 
restrictions to small subspaces. 

Note that Theorem 6 implies the combinatorial fact that for any function / that is e-far from 
OCF, there must exist a short witness to this fact. That is, there must exist xi,. . . ,Xk G supp(/) 

with xi-\ \-Xk = and k = 0(log 1/e) an odd integer. In fact. Theorem 6 asserts that there must 

exist many such witnesses, but a priori, it is not clear that even one such witness exists. This is in 
contrast to properties such as triangle- freeness studied in [Gre05], where witnesses to violations of 
triangle-freeness are, by definition, short, and for testability, one "only" needs to show that there 
exist many such witnesses. 

1.3 Organization 

The rest of the paper is organized as follows. In Section 2 we show that one can relate the distance 
of / from OCF to the Fourier expansion of /. In Section 3 we use this relation together with 
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some results from spectral graph theory in order to analyze the edge-sampling test and thus prove 
Theorem 3. We also show in this section that a strengthening of the analysis for the edge-sampling 
test yields a distance estimator. In Section 4, we analyze the subspace restriction test and prove 
Theorem 6. Finally, Section 5 contains some concluding remarks and a discussion of some open 
problems related to this paper. 

2 Odd-Cycle-Freeness and the Fourier Spectrum 

Our goal in this section is to give two reformulations of OCF, one of a geometric flavor and one 
in terms of the coefficients of the Fourier expansion of /. These characterizations of OCF will be 
useful in the analysis of both the edge-sampling test and the subspace restriction test which will be 
given in later sections. We begin by recalling some basic facts about Fourier analysis of Boolean 
functions. 

The orthonormal characters {xa : F2 ^ M, Xa{x) = {—^)°''^}ae¥" fo™ a basis for the set of 
{0, l}-valued functions defined over F2, where the inner product is given by {f,g) = E2:[/(.t)(/(.t)]. 
The Fourier coefficient of / at a G F2 is f{a) = E.x[f{x)xa{x)]. The density of / is p = E^; f(x) = 
/(O), and notice that p = max^gFj |/(a)|. The support of / is supp(/) = {x £ F2|/(x) 7^ 0}. 

Parsevals's identity states that f{a) = Kx f{x) = p. Also, for all a, f{a) > max(— p, —1/2). 

We first notice that the presence of cycles in a function induces a certain distribution of the 
density of the function on halfspaces. 

Claim 7 Letf:¥^^ {0, 1}. Then: 

(a) f is OCF if and only if there exists a G F2 such that for all x G supp(/), a-x = 1 (i.e., there 
exists a linear subspace of dimension n — 1 that does not contain any element of supp(/) ). 

(h) f is e-far from OCF if and only if for every a G F2, it holds that for at least e2" many 
x G supp(/), a ■ x = (i.e., every linear subspace of dimension n — 1 must contain at least 
62" elements o/supp(/)J. 

Proof We first prove part (a). To see the "if" direction, suppose / is not odd-cycle-free, but 
there exists a such that for all x G supp(/), a-x = 1. Now, let xi, . . . , Xk-i G supp(/) be such that 
xi+- • ■+Xk_i G supp(/) and k is odd. But then a-{xi+- ■ --l-a^fc-i) = ct-Xj = 0, a contradiction. 

For the opposite direction, suppose / is odd-cycle-free. If / is the zero function, we are vacuously 

done. Assuming otherwise, let S = supp(/), and consider the set H' = {xi-\ \-Xk : xi, . . . , Xfc G 

S, and fc > is even}. Since / is OCF, H' r\ S = %. It is easy to see that H' is a linear subspace 
of codimension 1 inside span(S'). It follows that H' can be extended to a subspace H of dimension 
n - 1 such that n 5 = 0. 

For part (b), if / is e-far from being OCF, then by part (a), every linear subspace H of dimension 
n—1 must contain e2" elements of supp(/) (otherwise removing less than e2" points from supp(/) 
would create a function that is OCF.) The converse follows again by part (a). □ 

Lemma 8 Let f -.F^^ {0, 1}. Then 
(a) f is OCF if and only if there exists a G F2 such that f{a) = —p. 
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(b) f is e-far from being OCF if and only if for all P G ¥2, f{P) > —p + 2e. 

(c) The distance of f from OCF is exactly ^ (^p + mina • 

Proof By Claim 7, / is OCF if and only if there exists a such that a • a; = 1 for all a; G supp(/), 
and so it follows that 

/(a) = E/(x)(-ir- = -p. 

This implies item (a) of the lemma. To derive item (b), we get from Claim 7, that / is e-far from 
OCF if an only if any halfspace contains at least an e fraction of the domain, implying that for 
each Pe¥^: 

^^^^ = = ^ ( E 1+ E (-1)) 

> e+{-p + e) = -p + 2e. 

Finally, observe that item (c) is just a restatement of item (b). □ 

We see that the minimum Fourier coefficient of / determines its distance from OCF. Since 
Fourier coefficients also measure correlation to linear functions, it is natural to ask about the 
relationship between a function's distance to OCF and its distance to linearity^. Easy Fourier 
analysis shows that the distance of a function / : F2 — > F2 to linearity is exactly min(/9, i + 
minQ,/(a)). So, the distance to linearity, in contrast to OCF, is not always determined by the 
minimum Fourier coefficient. 



3 The Edge-Sampling Test 

In this section, we analyze the edge-sampling test and prove Theorem 3. The analysis starts with 
the characterization of OCF given in the previous section and then proceeds to reduce the problem 
of testing OCF for Boolean functions to testing bipartiteness in dense graphs. We then show why 
Theorem 4 follows. 

For a function / : — ?• {0,1}, define the Cayley graph G{f) = {V,E) to be the graph with 
vertex set V = F2 and edge set E = {{x,y) \ f{x — y) = l}. Let us denote by = 2" and let Ag 
be the adjacency matrix of Q. The next lemma is well-known but we include its proof for the sake 
of completeness. 

Lemma 9 For any a E ¥2, the character Xa *5 0,1^ eigenvector of Ag of normalized eigenvalue 
f{a). Moreover, the set {2'^ f{a)}a is exactly the set of all the eigenvalues of Ag. 

Proof Notice that the entry indexed by in 6 = Axa is 

''A function / : F2 — J> F2 is said to be linear if f{x + y) = f{x) + f{y) for all x,y (the range {0, 1} has been 
identified with F2). Note that linear functions are OCF. However, the converse is certainly false, since the function 
f{x) = X1X2 is OCF but 1/4-far from linear. 
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= Xa{Xi) f{x)Xa{x) 

Therefore, Axa = {2"'f{<^))Xa, and since the set of characters contains 2" orthogonal vectors, the 
lemma follows. □ 

We remind the reader that in the context of testing graph properties, a graph is e-far from being 
bipartite if one needs to remove at least en^ edges in order to make it odd-cycle-free. In order to 
be able to apply results concerning testing odd-cycle-freeness in graphs we will have to prove that 
Q{f ) is in fact e-far from any bipartite graph. To this end, we show the following lemma that relates 
the distance to being bipartite to the least eigenvalue of the adjacency matrix. In what follows, we 
denote by e{S) the number of edges inside a set of vertices S in some graph G. 

Lemma 10 Let Xmin be the smallest eigenvalue of the adjacency matrix A of an n-vertex d-regular 
graph G. Then for every U C V{G), we have 

e{U)>^-^{\U\d + \mUn-\U\)). (1) 

Proof Let u be the indicator vector of U. We clearly have 

u^Au = 2e{U) . 

Since A is symmetric it has a collection of eigenvectors v\,...,Vn which form an orthonormal basis 
for M". Let Ai, . . . , A„ be the eigenvalues corresponding to these eigenvectors where A„ = Xmin- 
Suppose we can write u = Ylll=i '^i'^^i in this basis and note that X^ILi ~ 1^1- Since G is d-regular, 
. . . , l/i/n) is an eigenvector of A. Suppose this is vi and note that this means that X\ = d 
and a\ = \U\l^/n. Combining the above observations we see that 



T 

u 



Au = Ajaf 

i=l 

n 

= d\Uf/n + J2^i(^ 



2 
i 

i=2 



n ^ 
„2 



> d\U\''/n + Xmin[J2' 

\i=2 / 

= d\U\yn + Xmin{\U\-\U\^/n) . 
We now get (1) by combining the above two expressions for u^Au. □ 

Corollary 11 Let G be an n-vertex d-regular graph with Xmin > —d-\-2en. Then G is e/ 2- far from 
being bipartite. 
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Proof It is clearly enough to show that in any bipartition of the vertices of G into sets A, B 
we have e{A) + e{B) > \en^. So let {A,B) be one such bipartition and suppose |^| = cn and 
\B\ = (1 — c)n. From Lemma 10 we get that 

e{A) > -{den + {-d + 2en){n - cn)) 

= |(2en^-dn) + y(2dn-2en^) , 

e{B) > ^^(2en2 - dn) + ^^-^^{2dn - 2en^) . 
e{A)+e{B) > ^{2en^ 

1 2 
= ^en , 

where in the second inequality we use the fact that + (1 — c)^ is minimized when c = 1/2. □ 

We can now derive the following exact relation between the OCF property of functions and the 
bipartiteness of the corresponding Cayley graphs. 

Corollary 12 Let / : F2 — ^ F2. // / is e-far from being OCF, then G{f) is e/ 2- far from being 
bipartite. Furthermore, if f is e-close to being OCF, then Q{f) is e/2-close to being bipartite. 

Proof Suppose / is e-far from being OCF. Let us suppose supp(/) = pN where N = 2". By 
Lemmas 8 and 9, if / is e-far from being OCF, then the smallest eigenvalue of the adjacency matrix 
of G{f) is Xmin + 2e)N. For a function /, recall that G{f) is a regular graph with degree 

d = |supp(/)| = pN. Hence, we can use Corollary 11 to infer that G{f) must be e/2-far from being 
bipartite. 

Suppose now that / is e-close to being OCF, and let S be the set of eA'^ points in F2 whose 
removal from the support of / makes it OCF. Call this new function /'. Observe that every 
X G F2 for which f{x) = 1 accounts for N/2 edges in G{f)- Hence, removing from G{f) all edges 
corresponding to S results in the removal of at most ^eN^ edges. To finish the proof we just 
need to show that the new graph G{f') (note that the new graph is indeed the Cayley graph of 
the new function /') does not contain any odd cycle. Suppose to the contrary that it contains an 
odd cycle ai, . . . ,ak,ai. For 1 < i < /c set Xj = Oj+i — a^. Then by definition of G{f') we have 
f'{xi) = ... = f'{xk) = 1. Furthermore, as ai -|- Yli=i^i — (since we have a cycle in G{f')) 
we get that Yli=i a^i = so xi, . . . , in an odd-cycle in /' contradicting the assumption that /' is 
OCF. □ 

We are now ready to complete the proof of Theorem 3 using the following result of Alon and 
Krivelevich [AK02]. 



and similarly 



Hence 



dn) + -(c^ + (1 - c)^){2dn - 2en^) 
dn) + ^{2dn - 2en^) 
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Theorem 13 ([AK02]) Suppose a graph G is e-far from being hipartite. Then, a random subset 
of vertices of V{G) of size 0{l/e) spans a non-bipartite graph with probability at least 3/4. 

Proof of Theorem 3 First, if / is OCF then the test will clearly accept / (with probability 

1). Suppose now that / is e-far from being OCF. Then by Corollary 12 we get that G{f) is e/2-far 
from being bipartite. Now notice that we can think of the points ai, . . . , G sampled by the 
edge-sampling test as vertices sampled from G{f). By Theorem 13, with probability at least 3/4, 
the vertices ai, . . . ,afc span an odd-cycle of Q{f). We claim that if this event happens, then the 
edge-sampling test will find an odd-cycle in /. Indeed, if ai, . . . ,ak,ai is an odd-cycle in G{f), 
then as in the proof of Corollary 12 this means that a2 — ai, . . . ,a3 — a2, ■ ■ ■ ,ai — C(k an odd-cycle 
of /. Finally, the edge-sampling test will find this odd cycle, since it queries / on all points — aj. 
□ 

In order to obtain Theorem 4, observe that by Corollary 12, the distance to OCF for a function 
/ is exactly double the distance to bipartiteness for the graph Q{f). We now invoke the following 
result of Alon, de la Vega, Kannan and Karpinski [AdlVKK03], which improved upon a previous 
result of Goldreich, Goldwasser and Ron [GGR98]. 

Theorem 14 ( [AdlVKKOS] ) For every e > 0, there exists an algorithm that, given input graph 
G, inspects a random subgraph of G on 0(l/e^) vertices and estimates the distance from G to 
bipartiteness to within an additive error of e. 

Proof of Theorem 4 Combining Theorem 14 with Lemma 12 we immediately obtain a poly(l/e) 
query algorithm that estimates the distance to odd-cycle-freeness with additive error at most e. 
Since one can use sampling to estimate p to within an additive error e using poly(l/e) queries, it 
follows from item (c) of Lemma 8 that one can estimate minQ, /(a) to within an additive error of e 
using poly (1/e) queries. □ 

As we have mentioned earlier, the distance of / from being linear is given by min(p, ^ -|- 
mina/(Q;)), where p is the density of /. Therefore, given an estimate of p and minQ /(a) for 
some function / : F2 — )• F2, one can also estimate the distance of / to linearity. Theorem 4 thus 
gives a new distance estimator for linearity, and hence also a two-sided tester for the property of 
linearity, both with poly (1/e) query complexity. 



4 The Subspace Restriction Test 

In this section, we analyze the subspace restriction test and prove Theorem 6. We start with a 
few notational remarks. For / : F2 — )• {0, 1} and subspace H < ¥2, let fn'-H^ {0, 1} be the 
restriction of / to H, and let pH denote the density of //f, namely pH = Prxei?[/if(aj) = !]• For 
a G F2 and subspace H, define the restriction of the Fourier coefficients of / to a subspace H to be 

fHia)= E[fix)xaix)]. 

Recall that the character group of H is isomorphic to H itself, and so, fn = J2aeH fH{oi)xa- The 
dual of H is the subspace = {x G F2 | (x, a) = Va G H}. Note that /ij(a) = fniP) whenever 
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ael3 + H^. The convolution of /, 5 : F2 is / * : ^ F2, (/ * g){c) = E^^pn f{x + c)/(x). 

It is known that f * g = f ■ g- In what follows, we will let h be the size of the the subspace H. 

The strategy of the proof is to use Lemma 8 and reduce the analysis to showing that if every 
nonzero Fourier coefficient of / is at least —p + 2e, then for a random linear subspace H, with 
probability 2/3, every nonzero Fourier coefficient of fn is strictly greater than —pH (where, again, 
fn '■ H — >■ {0, 1} is the restriction of f to H and pn is the density of ///). 

A useful insight into why this should be true is that the restricted Fourier coefficients are 
concentrated around the non-restricted counterparts, deviating from them by an amount essentially 
inversely proportional with the size of the subspace H. A direct union bound type argument is 
however too weak to give anything interesting when the size of H is small. The idea of our proof 
is to separately analyze the restrictions of the large and small coefficients. Understanding the 
restrictions of the small coefficients is the more difficult part of the argument, and the crux of 
the proof relies on noticing that the moments of the Fourier coefficients are also preserved under 
restrictions to subspaces. In particular, an analysis of the deviation of the fourth moment implies 
that one can balance the parameters involved so that even when H is of size only poly(l/e), no 
restriction of the coefficients of low magnitude can become as small as —pn- 

We first show that the restriction of / to a random linear subspace does not change an individual 
Fourier coefficient by more than a small additive term dependent on the size of the subspace. This 
follows from standard Chebyshev-type concentration bounds. 



Lemma 15 




Proof 



Now, consider the deviation of Ei/[//f(Q;)] from f{a): 



mfH{a)] = E E [f{x)xa{x)] 




>f{(^)-^-T> /(«) - r 
- J\ J 2" h ~ ^ ^ h 



Similarly: 



E[/^(a)] < /(a) + I 



So, it suffices to show that Pr |///(a) - /jY(a)| > 77 < 10/(/i??2). We prove this by bounding 
the variance of Jh (a) • 
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E 

H 



< E 

H 



^ ^f{x)f{y)Xa{x)Xa{y) 
x,yeH 



Pr[dim(span(x,y)) < 2] + E 

x,yeH 
dim(span(x,3/))=2 



f{x)f{y)xa{x)xa{y) 



3 

< -+ E 

dim(span{ic,y))=2 



f{x)f{y)xa{x)Xa{y) 



Hence Var[///(Q;)] < ^, and the lemma now follows by Chebyshev's inequality. 



□ 



As it was the case with the restricted coefficients, it can also be shown using a straightforward 
variance calculation that the fourth moment is preserved up to small additive error upon restriction 
to a random H, when h is large enough. For that purpose, define A and Ah as follows: 



Ah Y: fni^ 



E Jf{x,)f{x2)f{x3)f{xi+X2 + Xs)] 



E [/(Xl)/(X2)/(X3)/(X1+X2 + X3)] 
Xl,X2,X3eH 



Then, we have: 
Lemma 16 



Pr 
H 



\AH-A\>—+r] 



< 



500 



hrj 



2 ■ 



Proof As in the proof of Lemma 15, our strategy will be to first show that Eif[Aif] is likely to 
be close to A and then to bound the variance of Ah- 



Claim 17 



Proof 



E[Ah]=E E „/(xi)/(a;2)/(a;3)/(a;i+i2+X3) 

H H X1,X2,X3QH 



> E 

H 



> E 

H 



1-- 1 E 

a I xi,x2,x3eH 

dim(span(i;i ,a;2,a;3))=3 



f{xi)f{x2)f{x3)f{xi +X2+ X3) 



E 

X\,X2,X3eH 

.dim(span(i;i,a;2,a;3))=3 



f{xi)f{x2)f{x2)f{xi +X2 + Xa) - 



h 



E f{x^)f{x2)f{x^)f{x^+X2 + X^)-- 

a;i,3;2,3;3eF2 h 
dim(span(a;i,a;2,a;3))=3 
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and similarly: 



Claim 18 



Proof 



ElAjr] = E 



r. n 500 

Var[^j^] < — . 



E f{xi)f{x2)fix3)f{xi +X2 + xz)f{yi)f{y2)f{y2,)f{yi + y2 + y?) 

x-^ ,3^2 



< E [Pr[dim(span({xi,X2,X3,yi,y2,y3})) < 6] 



□ 





64 




< 


X 


+ 




64 




< 




+ 



+ E [/(Xl)/(X2)/(X3)/(X1 + X2 + X3)/(2/l)/(y2)/(2/3)/(yi + 2/2 + 2/3)] 

a^l .!C2 ,!E3 ,2/1 ,!/2 .J/SS'' 
dim(span({xi,a!2,a!3,j/i,!/2.a3}))=6 

E [/(^1)/(X2)/(X3)/(X1 + X2 + X3)/(yi)/(2/2)/(y3)/(2/l + 2/2 + 2/3)] 

a:i.^2.^3i!'l>!'2.!'36F2 
dim(span({a;i,a:2,a:3,yi,!/2'!'3}))=6 



128 /16 y 500 , , ,9 



The lemma now follows by Chebyshev's inequality. 

Using Lemma 15 and 16 we can now proceed with the proof of Theorem 6. 



□ 
□ 



Proof of Theorem 6 If / is e-far from odd-cycle-free, then p > e, and by Lemma 8, all its 
Fourier coefficients are > —p + 2e. We need to show that with constant probability over random 
choice of H, each Fourier coefficient of fn is > —pn- We separate these coefficients into the sets of 
large and small coefficients and analyze them separately. Define 

L = {a I I /(a) I > 7} C E^ and S = E^\L 

for some 7 < p to be chosen later. Notice that G L. Also, by Parseval's identity, |L| < 1/7^. Let 
Lh C be the set of elements (3 & H such that there exists a e L with (3 E a + H^, that is, P is 
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the "projection" of some large coefficient. Then \Lh\ < \L\. Let Sh = H\Lh be the complement 
of Lh in H. 



From Lemma 15, for each a ^ L and for any rji G (0, 1), we have Pth — f{'^)\ ^ f + '?i 



< 



By a union bound, with probability 1- 



for every a G L//, it holds ///(a) > /(a)- 
Moreover, since G L, we know \pH — P/| < f + If 2r7i + | < 2e, then for any a G Lh, we have 



fnia) > f{a) 



2 

h 



Vi> -P + 2e 



2 



?7i > -PH + 2e 



4 



2rii > -pH 



with probability at least 1 — ^^^^ • 

We now analyze the coefficients P G Sh and again show that with constant probability, no 
fa {(3) becomes as small as —pn- As we described in the informal proof sketch earlier, for this, we 
will want to analyze the fourth moment of the Fourier coefficients. 

To this end, first observe that for any two Fourier coefficients a, a' G L, their projections are 
identical if a — a' G H-^. Over the random choice of H, this happens with probability at most ^. 
Therefore, using a union bound, we conclude that with probability at least 1 — \L\'^/h = 1 — 
all the large Fourier coefficients project to distinct coefficients in H, namely \Lh\ = \L\. Let us 
condition on this event that no two large Fourier coefficients in L project to the same restricted 
coefficient. 

Let us also condition on the event that 1^4 — Ah\ < ^ + r)2 for some r]2 to be specified later. 
Also, condition on the event that for all a G L, |/i/(a) — f{a)\ < f + J?!- All of these events occur 



with probability at least 1 



500 
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by Lemmas 15 and 16. 



The following claim shows that the fourth moment of the small Fourier coefficients is also 
preserved under a random subspace restriction. 



Claim 19 



16 4 

h 7^ 



+ m 



Proof We have that 



E/^(«)-E/^ 



< E|/^(«)-/V)| 



It follows that 



E /^(") - E 



< EiM")-/(«)i(E/^(")'/(") 

aS-L i=0 

< 4 • |L| • max |/jf(a) - /(a 
4 /2 

< 



< 



T \h 



+ V1 ■ 



Ah- E - U-E/'(«) 



aeLH 



aeL 



14 



< \Ah-A\ + 



16 4 
h 



2 



□ 



Now, on the one hand, we have: J2aes f^('~'^) ^ 7'^Z]a/^('^) ^ 7^- 0^ other hand, 
maxag5^ ffji'^) ^ Sae5// ffii^)- Therefore, combining and using Claim 19, we have: 
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max fnia) < 7^ + ??2 + ^ + 



+ m 



aeSH ' " ' ' ' " h "y^ \h 
We need to choose the parameters such that maxcgs^ \fH{c()\ < pn, and so, it is enough to have: 

16 



h 72 V/i 



7 +V2 + ^ + -^{-r + 'ni]<[(^-T-m 



h 



Additionally, we need to ensure that the events we have conditioned on occur with probability at 
least 2/3. So, we want: 

500 14 1 1 



.2 + ;,^,2„2 + ^Ah ^ 3 



hri2 h'y^7]( 7^/1 

One can check now that the following setting of parameters satisfies both of the above con- 
straints: 7 = eVlOO, h = (10/e)20, r/i = (e/10)^, m = (e/10)^. □ 



5 Concluding Remarks and Open Problems 

• The main open question raised here (Question 5) is whether it is possible in general to obtain 
canonical testers for subspace-hereditary properties with only a polynomial blow up in the 
query complexity. Here, we show this to be true for OCF, and [BXIO] showed the existence 
of a canonical tester with quadratic blowup for the triangic-frccncss property. On the other 
hand, there is some evidence to the contrary also. Goldrcicli and Ron in [GRll] proved 
a nontrivial gap between canonical and non-canonical testers for graph properties. They 
showed that there exist hereditary graph properties that can be tested using 0(e~^) queries 
but for which the canonical tester requires jl(e~^/2) queries. Perhaps, this indicates that for 
subspace-hereditary properties also, there is a non-trivial, maybe even super-polynomial in 
this case, gap between non-canonical and canonical testers. 

• As previously mentioned, OCF is in fact the only monotone property characterized by freeness 
from an infinite number of equations (of rank 1). We briefly comment here on the equivalence 
between all these properties. It is easy to see that even-length equations can be handled 
trivially. Suppose now that V is defined by freeness from all equations of length belonging to 
the infinite set of odd integers S = {ki, /c2, . . .}. Note that OCF C "P. Now suppose k ^ S 
and k is odd, and let k' be the smallest element of S such that k < k'. If / G P is not free 
of solutions to the length k equation, then / is not free of solutions to the equation of length 
k', since a solution (xi, . . . ,Xk) to the former induces a solution (xi, . . . ,Xk,xi,xi . . . ,xi) to 
the latter. 
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• Another open problem that arises is to characterize the class of linear-invariant properties 
that can be tested using poly(l/e) queries. For monotone properties that can be charac- 
terized by freeness from solutions to a family T of equations, we conjecture that there is a 
sharp dichotomy given by whether T is infinite or finite. It follows from Theorem 3 and the 
discussion in the previous item that when T is infinite, the query complexity is poly(l/e). 
When T is finite and the property is nontrivial, then the property is equivalent to being 
free of solutions to a single equation xi + • • • + = for some odd integer k > \. In this 
case, we conjecture that the query complexity is super-polynomial, although the current best 
lower bound is only slightly non-trivial: r2(l/e^'^^^) for testing triangle-freeness [BXIO]. For 
non-monotone properties characterized by freeness from solutions to a family of equations, 
[CSXll] showed that (C3, 110)-freeness can be testing using 0(l/e^) queries (recalling the 
notation in Section 1), but there is no systematic understanding at present of when poly(l/e) 
query complexity is possible for larger equations or for arbitrary intersections of such non- 
monotone properties. For properties characterized by freeness from solutions to a system of 
equations of rank greater than one, even less is known. 

• Another open problems left open by our results is whether the 0(l/e^) bound for odd-cycle- 
freeness is tight. This is indeed the case for bipartiteness testing in graphs [BT04], but a 
direct analogue of their hard instances does not seem to work in our case. 

• One could also ask Question 5 for linear-invariant properties that are not subspace-hereditary. 
Given a linear-invariant property P, we say that a tester T is canonical for V if there exists a 
fixed linear-invariant property V' (not necessarily the same as such that when T is given 
oracle access to a function / : F2 — s- {0, 1}, it operates by choosing uniformly at random a 
subspace < F2 and accepting if and only if / restricted to H satisfies the property V' ■ 
Notice that unlike the subspace-hereditary case, the canonical tester now need not be one- 
sided. The stronger form of Question 5 is whether it is the case that for every linear-invariant 
property 7^, there exists a canonical tester for V with query complexity poly(q(n, e)) whenever 
V is testable with query complexity g(n, e) by some tester. Goldreich and Trevisan [GT03] 
showed the existence of such a canonical tester with polynomial blowup for graph properties. 
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